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Abstract 

We describe a general method to obtain quantum speedups of classical algorithms which are 
based on the technique of backtracking, a standard approach for solving constraint satisfaction 
problems (CSPs). Backtracking algorithms explore a tree whose vertices are partial solutions 
to a CSP in an attempt to find a complete solution. Assume there is a classical backtracking 
algorithm which finds a solution to a CSP on n variables, or outputs that none exists, and 
whose corresponding tree contains T vertices, each vertex corresponding to a test of a partial 
solution. Then we show that there is a bounded-error quantum algorithm which completes the 
same task using logn) tests. In particular, this quantum algorithm can be used to 

speed up the DPLL algorithm, which is the basis of many of the most efficient SAT solvers used 
in practice. The quantum algorithm is based on the use of a quantum walk algorithm of Belovs 
to search in the backtracking tree. We also discuss how, for certain distributions on the inputs, 
the algorithm can lead to an exponential reduction in expected runtime. 


1 Introduction 

Grover’s quantum search algorithm [34] is one of the great success stories of quantum computation. 
One important domain to which the algorithm can be applied is the solution of constraint satisfac¬ 
tion problems (CSPs). Consider a constraint satisfaction problem (CSP) expressed as a predicate 
P : [d]"' —)• {true,false}, where [d] = (0,... ,d — 1}. We would like to find an assignment x to 
the n variables such that P{x) is true, or output “not found” if no such x exists. This framework 
encompasses many important problems such as boolean satisfiability and graph colouring. Grover’s 
algorithm solves such a CSP using 0{^/dP) evaluations of P, whereas with no further information 
about P, hnding an x such that P{x) is true requires ^{dP) evaluations classically in the worst 
case. However, when we are faced with an instance of a CSP in practice, we usually have some 
additional information about its structure. For example, P may be defined as the conjunction of 
smaller constraints of a particular type, as in the case of graph colouring. This information often 
allows classical algorithms to solve the CSP significantly more efficiently than the above bound 
would suggest, throwing some doubt on whether straightforward use of Grover’s algorithm will 
really be used to solve CSPs in practice. 

One of the most important and most general classical tools to take advantage of problem 
structure, both in theory and in practice, is backtracking [7]. This technique can be used when 
we have the ability to recognise whether partial solutions to a problem can be extended to full 
solutions. We assume that the predicate P allows us to pass it a partial assignment x of the form 
a: : S' —)■ [d], where S C {1,... ,n}, which specihes the values assigned to the variables in the set 
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Assume that we are given access to a predicate P : T> ^ {true, false, indeterminate}, and a 
heuristic h : P—)• {l,...,n} which returns the next index to branch on from a given partial 
assignment. 

Return bt (*"■), where bt is the following recursive procedure: 
bt(a:): 

1. If P{x) is true, output x and return. 

2. If P{x) is false, or x is a complete assignment, return. 

3. Set j = h{x). 

4. For each u; G [d]: 

(a) Set y to X with the j’th entry replaced with w. 

(b) Call bt(y). 

Algorithm 1: General classical backtracking algorithm 


S. We can equivalently think of x as an element of P := ([d] U {*})"', where the *’s represent the 
positions which are as yet unassigned values. We say that x is complete if it contains no *’s. Then 
P returns “true” if x is a solution to P, “false” if it is clear that x cannot be extended to a solution 
to P, and “indeterminate” otherwise. We say that a partial assignment x is valid if P{x) is true or 
indeterminate, and invalid if P{x) is false. 

Algorithm 1 above describes a generic way to use this information classically. The algorithm 
assumes access to P and a heuristic h{x) which determines how to extend a given partial assignment 
X. We think of P and h as black boxes (“oracles”). The basic idea is to fail early: if we know that a 
partial assignment cannot be extended to a solution, we should give up on it and try a different one. 
We can think of the algorithm as exploring a tree, whose internal vertices are partial solutions to 
P, and whose leaves are solutions to P or certificates that the partial solution cannot be extended 
to a complete solution. This tree is of size at most 0{d^), but for some problem instances could be 
substantially smaller. 

A canonical example of a powerful backtracking algorithm which fits into the framework of 
Algorithm 1 is the DPLL (Davis-Putnam-Logemann-Loveland) algorithm [24, 23] for fe-SAT. This 
algorithm forms the basis of many of the most successful SAT solvers used in practice [25, 41, 32]. 
For many practically relevant problem instances, the algorithm runs more quickly than worst-case 
upper bounds would suggest. Another appealing aspect of this algorithm is that, unlike “local 
search” methods based on random walks or similar ideas, it can sometimes produce efficient proofs 
of unsatisfiability, corresponding to small backtracking trees. 

Algorithm 1 outputs all solutions x such that P{x) is true. While in practice the algorithm 
might be modified to terminate when the first solution is found, here we will assume throughout 
that the entire tree is explored. We assume that P and h can both be evaluated in time poly(n), so 
the most important contribution to the complexity of Algorithm 1 is usually the number of vertices 
in the tree, which can often be exponential in n. To simplify the complexity bounds, we also assume 
throughout that d = 0(1); this is effectively without loss of generality as any predicate with local 
domain size d can be replaced with one which uses 0(log(i) bits to encode each variable. 
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1.1 Results 


We show here that there is a quantum equivalent of Algorithm 1 which can be substantially faster: 

Theorem 1. Let T be an upper bound on the number of vertices in the tree explored by Algorithm 
1. Then for any 0 < <5 < 1 there is a quantum algorithm which, given T, evaluates P and h 
0{\/Tn log(l/(5)) times each, outputs true if there exists x such that P{x) is true, and outputs false 
otherwise. The algorithm uses poly(n) space, 0(1) auxiliary operations per use of P and h, and 
fails with probability at most 6. 

We usually think of T as being exponential in n; in this regime this complexity is a near¬ 
quadratic speedup over the classical algorithm. The algorithm can be modified to find a solution, 
rather than just detect the existence of one, with a small runtime penalty: 

Theorem 2. Let T be the number of vertices in the tree explored by Algorithm 1. Then for any 
0 < (5 < 1 there is a quantum algorithm which evaluates P and h 0{-\/Tn^/‘^\ogn\og{l/5)) times 
each, and outputs x such that P{x) is true, or “not found” if no such x exists. If we are promised 
that there exists a unique xq such that P{xq) is true, there is a quantum algorithm which outputs xq 
using P and h 0{VTn log^ nlog(l/5)) times each. In both cases the algorithm uses poly(n) space, 
0(1) auxiliary operations per use of P and h, and fails with probability at most 6. 

We stress that these results can be applied to any backtracking algorithm which fits into the 
framework of Algorithm 1, whatever the predicate P or the choice of the heuristic h. In particular, 
they can be applied to the DPLL algorithm with the commonly used “unit clause” heuristic. 
Theorems 1 and 2 can also be applied to backtracking algorithms which make use of randomness 
in the heuristic h, by interpreting these algorithms as first fixing a random seed, then using this 
seed as input to a deterministic heuristic h. Observe that the runtime bound of Theorem 2 is 
instance-dependent and, to use it, we do not need to know an upper bound on the runtime T of the 
underlying classical backtracking algorithm. For instances on which the classical algorithm runs 
quickly, the quantum algorithm also runs quickly. 

These algorithms can be leveraged to obtain an exponential separation between average quantum 
and classical runtimes. The speedup for any given instance is approximately quadratic. However, 
given the right distribution on the input instances, this can be amplified to an exponential separa¬ 
tion. This is discussed further in Section 4. 

1.2 Techniques 

The algorithms which achieve the bounds of Theorems 1 and 2 are based on the use of a discrete¬ 
time quantum walk to find a marked vertex within the tree produced by the classical backtracking 
algorithm, corresponding to a partial solution x such that P{x) is true. Quantum walks have 
become a basic tool in quantum algorithm design [16, 2, 51, 43]. In particular, they have been 
applied in several contexts to solve search problems on graphs [49, 51, 43, 39], sometimes achieving 
up to a quadratic speedup over classical algorithms. However, in prior work it is usually assumed 
that the input graph is known in advance, and moreover that the initial state of the quantum 
walk is the stationary distribution of the corresponding random walk. Aaronson and Ambainis [1] 
described a different approach to spatial search on graphs; this does not use a quantum walk, but 
also assumes the input graph is known in advance. 

Here we would like to use quantum walks in a context where the input graph is defined implicitly 
by the backtracking algorithm and hence is not known in advance, and where the walk starts at 
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the root of the tree. One of the few cases where such walks have been studied is beautiful work 
of Belovs [8, 9]. The main result of that work relates the complexity of detecting a marked vertex 
by quantum walk on a graph to the effective resistanee of the graph. Informally, this quantity is 
determined by thinking of the graph as an electrical circuit and calculating the resistance between 
the initial vertex and the set of marked vertices. Belovs’ result can be seen as a quantum variant of 
previous classical work characterising properties of random walks on graphs (such as the commute 
time and cover time) in terms of effective resistance [15]. 

The main quantum subroutine used here is just the special case of Belovs’ result where the 
underlying graph is a tree, for which we include a slightly more concise correctness proof. We are 
also able to extend Belovs’ work to give an algorithm for finding a marked vertex in a tree, rather 
than just detecting one. This can easily be achieved using binary search; in the case where there 
is promised to be a unique marked element, we give a more efficient algorithm based on analysing 
eigenvectors of the quantum walk. 

Once we have the quantum search algorithm, all that remains is to check the claim that the P 
and h functions can indeed be used to implement the required quantum walk operations, namely 
mixing across the neighbours of a vertex in the tree, dependent on whether the vertex is marked. 
To do this one has to be careful to ensure that the quantum walk steps are implemented efficiently. 

1.3 Other prior work 

Backtracking is a fundamental technique in computer science and has been studied since at least 
the 1960s. The classical literature on this topic is too vast to summarise here; see [38, 29, 7] for 
introductions to the topic and historical overviews. Cerf, Grover and Williams attempted to find 
a direct quantum speedup of backtracking algorithms [14]. The algorithm of [14] is based on a 
nested version of Grover search. The complete tree of partial assignments is expanded to a certain 
depth, then quantum search is performed within the subspace of partial assignments which have 
not yet been ruled out. The complexity of the algorithm depends on the number of valid partial 
assignments at this depth. It is argued in [14] that, for some reasonable distributions on random 
CSPs, the average complexity of the quantum algorithm (over the distribution on instances) will 
be smaller than would be obtained from Grover search. By contrast, the bounds of Theorems 1 
and 2 hold in the worst case and are applicable to arbitrary backtracking algorithms: if a faster 
backtracking algorithm is found, we immediately obtain a faster quantum algorithm. 

The algorithm used here can be seen as an extreme version of the nested search strategy of [14]. 
The diffusion operation used in the quantum walk can be viewed as applying Grover search within a 
subspace spanned by a vertex in the tree and its children. The algorithm repeatedly performs these 
searches across many vertices and levels simultaneously. On the other hand, the algorithm of [14] 
can be seen as accelerating a restricted classical backtracking algorithm which uses a predicate P 
which is only capable of detecting whether partial assignments at a particular level are false. 

Similarly to the present work, Farhi and Gutmann [28] have studied the use of quantum walks 
to speed up classical backtracking algorithms by searching within the backtracking tree. These 
authors showed that there are some trees for which continuous-time quantum walks can be used 
to find a marked vertex exponentially faster than a classical random walk. The special structure 
of these trees leads to interference effects which enable the quantum walk to penetrate the tree 
more quickly than the random walk. However, for the examples presented in [28] where there is an 
exponential speedup of this form, the structure of the tree enables an alternative classical algorithm 
to also find a marked vertex efficiently. Here, we seek to accelerate classical search in arbitrary 
trees, with no prior assumptions about the structure of the tree. 
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A related, but different, approach towards quantum speedup of recursive classical algorithms 
was proposed by Fiirer [30]. Imagine we have a constraint satisfaction problem for which we can 
put a non-trivial upper bound L on the number of leaves in the computation tree of a recursive 
classical algorithm for solving the problem. The idea of [30] was to apply Grover search over the 
leaves of the computation tree to find a solution in time 0(\/Lpoly(n)). This approach relies on 
knowing, in advance, an efficiently computable mapping associating each integer between 1 and L 
with a leaf. For many more complicated recursive algorithms we may not know such a mapping. 
Indeed, there is some evidence that it may not be possible to compute such a mapping for general 
backtracking algorithms in polynomial time [50]. The quantum algorithm presented here, on the 
other hand, can be applied to any classical backtracking algorithm, even if we do not know a bound 
on L in advance. 

A somewhat similar idea to Fiirer’s was previously used by Angelsmark, Dahllof and Jonsson [5] 
to obtain quantum speedups for CSPs. These authors observed that, for certain CSPs, one can 
construct a set of easily checked certificates, for some c < 1, such that the existence of a 
solution to the CSP is certified by at least one certificate. Then Grover search can be used to find 
a certificate, if one exists, in time poly(n)). 

An alternative, and simpler, approach to find quantum speedups of classical algorithms for 
CSPs is the use of amplitude amplification [12]. This can be applied to any classical algorithm 
which can be expressed as repeatedly running a randomised subroutine which runs in time poly(n) 
and finds a solution with probability p. The corresponding quantum algorithm has a runtime of 
0((l/y^) poly(n)), a near-quadratic improvement on the classical 0((l/p) poly(n)) if p is small. 
For example, it was observed by Ambainis [2] that Schoning’s efficient randomised algorithm for 
fe-SAT [47] can be accelerated in this way; Dantsin, Kreinovich and Wolpert [22] gave several other 
examples. Deterministic backtracking algorithms are, of course, not amenable to this approach. 

Finally, a completely different technique for solving CSPs is the quantum adiabatic algo¬ 
rithm [27]. Although there is some numerical evidence that this algorithm may outperform classical 
algorithms for CSPs [26], the adiabatic algorithm’s runtime is hard to analyse for large input sizes 
and there is as yet no analytical proof of its superiority over classical algorithms. 

Quantum walks on trees have been used previously in a quite different context, to obtain a 
near-quadratic speedup for evaluation of AND-OR formulae [3]. In that algorithm the structure 
of the formula (which is known in advance) defines the tree on which the walk takes place. It is 
interesting to note that the quantum walk used in [3] is similar to the quantum walk used here, but 
has apparently quite different properties. Another case in which the concept of effective resistance 
was used in quantum computing is work by Wang, which gave an efficient quantum algorithm for 
approximating effective resistances [53]. This uses some similar ideas to the present work but does 
not seem directly applicable. 

1.4 Organisation 

We begin in Section 2 by describing the main underlying quantum ingredient, the use of a quantum 
walk to detect a marked vertex in a tree. This algorithm is a special case of an algorithm described 
by Belovs [8]. We then go on in Sections 2.1 and 2.2 to describe extensions to this algorithm to allow 
finding a marked vertex, and a faster runtime in the case where we know there is a unique marked 
vertex. Section 3 shows that the algorithm can be applied to accelerate backtracking algorithms for 
CSPs. Section 4 discusses how to use the algorithm to obtain exponential reductions in expected 
runtime, while Section 5 concludes with a discussion of some ways in which the algorithm could be 
improved, and barriers to doing so. 
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1.5 Preliminaries 


We will need the following tools, which have been used many times elsewhere in quantum algorithm 
design: 

Lemma 3 (Effective spectral gap lemma [40]). Let Ha o,nd Hb he projectors on the same Hilbert 
space, and set Ra = 2IIa — I, Rb = ^Hb — I- Let he the projector onto the span of the 
eigenvectors of RbRa with eigenvalues sueh that \6\ < x- Then, for any vector [?/;) such that 
n^lV’) = have 

\\p^HBm<x\\m- 

Theorem 4 (Phase estimation [17, 37]). For every integer s > 1, and every unitary U on m qubits, 
there exists a uniformly generated quantum eircuit C such that C acts on m + s qubits and: 

1. C uses the controlled-U operator O(2^) times, and eontains O(s^) other gates. 

2. For every eigenvector \^p) of U with eigenvalue 1, C'j?/:)j0®) = j'i/’)|0^). 

3. If U\'il}) = where 9 G (0,7r), then C\'ijj)\0^) = \f:)\uj), where jw) satisfies j(a;j0'^)j^ = 

sin2(2^6l)/(22® sin^ 9). 

4- For any \(j)) G (C^)®™', expanded as \(f)) = Xlfc where jV’fc) is an eigenvector of U with 

eigenvalue then 

cm‘} = Y: Afc|V'fc)|wfc), 

k 

where = C>(l/(2^e)). 

Call 2~^ the precision of the circuit. 

Phase estimation is normally used to estimate eigenvalues of U (hence its name); here, however, 
similarly to [43] we will only need to apply it to distinguish the eigenvalue 1 from other eigenvalues. 
If the smallest nonzero phase is e, this can be done with 0(l/e) uses of controlled-17. 

Fact 5 (Close states and measurement outcomes, e.g. [11]). Let |V’i); IV’ 2 ) be quantum states 
satisfying |||V'i) — IV’ 2 )|| = Then the total variation distance between the two distributions on 
measurement outcomes obtained by measuring each state in the computational basis is at most e. 

(This fact is usually presented with e replaced with 4e [11]; the tighter constant stated here can 
easily be obtained by relating the fidelity of \^pl) and [^^ 2 ) to their trace distance, for example.) 


2 Quantum walks on trees 

We now describe a quantum algorithm for detecting a marked vertex in a tree. The algorithm 
is a special case of a beautiful connection between quantum walks and electrical circuits due to 
Belovs [8] (see also [9]), which is a quantum analogue of a similar connection between random 
walks and electrical circuits [15]. This is conceptually elegant and leads to a very concise proof of 
a previous result of Szegedy [51] on detecting marked elements using a quantum walk. Here we 
only use these ideas for the special case of trees and a quantum walk starting at the root. This will 
enable us to simplify some notation and, hopefully, make the algorithm more intuitive. 


6 


Input: Operators Ra, Rb, a failure probability 5, upper bounds on the depth n and the 
number of vertices T. Let /3 ,7 > 0 be universal constants to be determined. 

1. Repeat the following subroutine K = [ 7 log(l/( 5 )] times: 

(a) Apply phase estimation to the operator RbRa with precision fi/y/Tn. 

(b) If the eigenvalue is 1, accept; otherwise, reject. 

2. If the number of acceptances is at least 3K/8, return “marked vertex exists”; otherwise, 
return “no marked vertex”. 

Algorithm 2: Detecting a marked vertex 


Consider a rooted tree with T vertices, labelled r, 1,... ,r — 1, with vertex r being the root, 
where the distance from the root to any leaf is at most n. Assume for simplicity in what follows 
that the root is promised not to be marked. For each vertex x, let i{x) be the distance of x from 
the root. We assume throughout that, although we do not necessarily know the structure of T in 
advance, we can determine (.{x) for any x. Let A be the set of vertices an even distance from the 
root (including the root itself), and let B be the set of vertices at an odd distance from the root. 
We write x —?■ y to mean that y is a child of x in the tree. For each x, let dx be the degree of x as a 
vertex in an undirected graph. Thus, for all x ^ r, dx = \{y : x —>• y}| + 1; and dr = \{y : r -A y}|. 

The quantum walk operates on the Hilbert space R spanned by {|r)}U{|x) : x G {1, ..., T— 1}}, 
and starts in the state |r). Unlike many discrete-time quantum walk algorithms, it does not use a 
separate “coin” space. The walk is based on a set of diffusion operators Dx, where Dx acts on the 
subspace Rx spanned by {|x)} U {\y) : x -A y}. The diffusion operators are defined as follows: 

• If X is marked, then Dx is the identity. 

• If X is not marked, and x 7 ^ r, then Dx = I — 2\^l;x){'lpx\, where 

a;) + ) • 

y,x^y / 



• Dr = I — 2\'ijjr){'4’r\, where 


IV’r) = 


\/l + drH 


r) -|- \/n 


E 

y,r^y 


Observe that Dx can be implemented with only local knowledge, i.e. based only on whether x is 
marked and the neighbourhood structure of x. A step of the walk consists of applying the operator 
RbRa, where Ra = ®xeA^x and Rb = |?’)(?’| + ®x&B^x- An alternative way of viewing this 
process is as a quantum walk on the graph given by the edges of the tree, where we identify each 
vertex with the edge from its parent in the tree, and add an additional “input” edge into the root. 

The algorithm for detecting a marked vertex is presented as Algorithm 2. 

Lemma 6 (Special case of Belovs [ 8 ]). Algorithm 2 uses Ra and Rb 0{^jTn log(l/(5)) times. There 
exist universal constants (3, 7 such that it fails with probability at most 6. 
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Proof. The complexity bound is immediate from Theorem 4. For the correctness proof, we first 
show that, if there is a marked vertex, then \r) is quite close to (a normalised version of) an 
eigenvector \(p) of RbRa with eigenvalue 1. Let xq be a marked vertex and set 

\f)) = Vn\r)+ ( 1 ) 

x^r^x-^XQ 

Here x xq denotes the vertices x on the unique path from the root to xq, including xq itself. 
To see that \(j)) is invariant under RbRa, first note that \<fi) is orthogonal to all states lipx), where 
X ^ r and x is not marked. Indeed, any such state {'ijjx) either has uniform support on exactly 2 
consecutive vertices v in the path from r to xq, or is not supported on any vertices in this path. 
If)) is also orthogonal to IV’r) by direct calculation. We have 

|||(/))||^ = n + i{xo) < 2n. 


Thus 


lll<^)ll - ^/2■ 


Therefore, phase estimation returns the eigenvalue 1 with probability at least 1/2. On the other 
hand, if there are no marked vertices, we consider the vector 


\v) = k) + 1^;). 

Let Hyi and H^ be projectors onto the invariant subspaces of Ra and Rb- These spaces are spanned 
by vectors of the form Ifj^) for x G x G H respectively, where is orthogonal to \ipx) and has 
support only on {|x)} U {|y) : x —>• y}; in addition to |r) in the case of Rb- On each subspace Rx, 
x G A, \r]) is proportional to lipx), so n^|r/) = 0. Similarly Hsl??) = |r). By the effective spectral gap 
lemma (Lemma 3), ||H^|r)|| = ||P^nB|7y)|| < xlll^)ll < xVTn. For small enough y = fl{l/y/Tn), 
this is upper-bounded by 1/2. By Theorem 4, there exists 13 such that applying phase estimation 
to RbRa with precision 13/y/Tn returns the eigenvalue 1 with probability at most 1/4. 

Using a Chernoff bound, by repeating the subroutine 0(logl/5) times and returning “marked 
vertex exists” if the fraction of acceptances is greater than 3/8, and “no marked vertex” otherwise, 
we obtain that the overall algorithm fails with probability at most 5. □ 


2.1 Finding a marked vertex 

From now on, we assume that the degree of every vertex in the tree is 0(1); this is not a significant 
restriction for the application to backtracking. For trees obeying this restriction we can use the 
detection algorithm as a subroutine to find a marked vertex efficiently, via binary search. 

To find a marked vertex, we start by applying Algorithm 2 to the entire tree. If it outputs 
“marked vertex exists”, we apply the algorithm to the subtrees rooted at each child of the root 
in turn, to detect marked vertices within each subtree. Assuming the algorithm did not fail at 
any point, there must be a marked vertex in at least one subtree. We pick the root of one such 
subtree and check whether it is marked. If it is marked, we output its label and terminate; if it is 
not marked, we apply Algorithm 2 to each of its children and repeat. This process continues until 
we have found a marked vertex. As there are at most 0(n) repetitions to reach a leaf and 0(1) 
subtrees are checked at each repetition, the time complexity of the algorithm is multiplied by a 
factor of 0(n). Note that, when we apply the algorithm to subtrees, we must leave the parameter 



T unchanged; this is because the tree could be quite unbalanced, and a given subtree could contain 
many vertices. 

We have thus far assumed that we know an upper bound on T in advance. If we do not, we 
can repeat the whole search algorithm O(logr) = 0{n) times, doubling a guess for T each time 
(starting with T = 1 ) until we either find a marked vertex, or the algorithm returns “no marked 
vertex”. This exponential doubling does not affect the asymptotic runtime. If our guess for T is 
too low, the correctness proof of Algorithm 2 no longer holds, so the detection algorithm may claim 
that there is a marked vertex in a situation where there is actually no marked vertex. This may 
lead to the above binary search procedure returning an incorrect result. But we can deal with this 
situation by checking the final vertex returned by the search algorithm, and only terminating if it 
is marked; if it is not, we know that the search has failed, and continue doubling our guess for T. 
On the other hand, one can see from inspecting the proof of Lemma 6 that, if there is a marked 
vertex, the phase estimation subroutine in Algorithm 2 will accept with probability at least 1/2 
whether or not our guess for T is large enough. Therefore, if there is a marked vertex. Algorithm 
2 will output that a marked vertex exists with probability at least 1 — d, for 6 of our choice. 

Using this procedure the total number of uses of Algorithm 2 (with differing values of T) is 
O(n^), so in order for the whole algorithm to succeed with probability, say, 2/3, it is sufficient to 
reduce the failure probability of each use of Algorithm 2 to 0(l/n^). This costs an additional time 
factor of O(logn) per use of the algorithm, giving a total runtime of 0{y/Tn^^‘^ logn). This can in 
turn be improved to an arbitrary failure probability 5 > 0 by taking 0 (log l/d) repetitions, leading 
to an overall bound of time 0 (\/Tn^/^ lognlog(l/(f)). 

Finally, we can find all marked vertices by simply repeating the algorithm, modifying the 
underlying oracle operator to strike out previously seen marked elements. If there are k marked 
elements, the overall runtime is 0{k\/Tn^^‘^ \ognlog{k/6)). 


2.2 Search with a unique marked element 


If we are promised that there exists a unique marked element in the tree, we can improve the above 
bounds by a factor of almost n. In general this improvement is not particularly large, as we usually 
have T ^ n; however, for some “tall and thin” trees it can be relatively significant. In particular, 
following this improvement we see that the complexity of the quantum algorithm for the search 
problem is never worse than the classical complexity 0{T), up to logarithmic factors. 

We assume that there is a unique marked vertex xq and that i{xo) = n. This second assumption 
is without loss of generality. We can determine (.{xq) at the start of the algorithm by applying 
Algorithm 2 to the subtree rooted at r and of depth i, for differing values of i. That is, we only 
expand the tree up to depth i, and use binary search on f G {1 ,..., n} to find the minimal i such 
that the tree of depth i contains xq. This needs O(logn) repetitions, so the complexity of this 
part is 0{^/Tn logn log logn), where the log log term comes from reducing the failure probability 
of Algorithm 2 to 0(l/(logn)). Once £(xo) is determined, we henceforth only search within the 
tree of depth £(xo)- 

Let |i?i') = |</>)/|||</>)||, where the eigenvector |0) is defined in (1), i.e. 


W) 





x^r^x-^XQ 


The starting point for the search algorithm is the observation^ that |(/') encodes the entire path 
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A similar observation was used in [53] to approximate effective resistances. 
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from r to xq. If we measure \4>'), and do not receive outcome r, we receive a measurement outcome 
y which is uniformly distributed on the path from r to xq. We can then repeat the algorithm on 
the subtree rooted at y, obtaining a new state of the form of |(/)') for a smaller value of n. The 
expected number of measurements we would need to make to find xq is logarithmic in n (rather 
than the bound of 0(n) which follows from the previous binary search algorithm). 

We first bound the total number of quantum walk steps used to find xq, given access to states 
of the form of for various subtrees. Let C = 0(l/logn) be chosen such that Algorithm 2 fails 
with probability at most l/(4n) and uses at most Cy/Tn steps. Given that £(xo) = n, measuring 
a copy of 10') will give a “good” outcome (which is not r) with probability 1/2. The distance from 
the root of such an outcome is uniformly distributed. Considering only the good outcomes, the 
expected total number of steps Sn to find xq, given that I'(xo) = n, therefore satisfies 

- n—1 

Sn<-^ Si + CVf^. 

n 

i=0 

We claim that Sn = 0{CyjTn). The proof is by induction. First, 5o = 0 as no quantum walk steps 
are made. Assume Si < 4C\/T^ for all i < n. Then 


AC 


n—1 


Sn< — V Vfi + CV¥n < AC 
n 


j=0 


\ 


1 ” ^ AC 

- V Ti + CViM = + C\/f^ 

n ^ a/9 




where the second inequality is Jensen’s inequality. As on average half the outcomes are good, the 
expected total number of steps is thus 0{y/Tn logn). 


We can approximately produce |0') by applying phase estimation to the operator RbRa, with 
input state |r). If we write 


^/2' 


^/2' 


where is normalised and orthogonal to |0), the result of applying phase estimation on |r) with 
s ancilla qubits is a state of the form 


1 

Vi 


IV)|o*) + 


1 

Vi 


^ Afc|0fc)|a;fc), 

k,0k>0 


where {ipk) is an eigenvector of RbRa with eigenvalue Write each \ujk) as \u)k) = yfc|0^) + |w(,) 

for some subnormalised vectors Iw/) orthogonal to |0®). If we obtain outcome |0®) when we measure 
the second register, the first register collapses to 


IV) = , IV) + ^ki^ki'ipk) ■ 

To bound the distance between |0') and the desired state |0'), we split the sum into two parts. For 
any e > 0, via Theorem 4 we have 

k,0k>e k,9k>e 

On the other hand, we prove the following technical claim in Appendix A. Recall that is the 
projector onto the span of the eigenvectors of RbRa with eigenvalues such that |0| < e. 
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Lemma 7. ||Pe|i;/)-*-)|| = 0{eVTn). 

Given Lemma 7, we have 

\Xkfik\^< \>^k\^ = \\Pe\^^)f = Oie^Tn). 

k,0<9f.<e k,0<9k<e 

Fixing an accuracy ^and taking e = Q{5/y/Tn), 2^ = 0{-\/Tn/5^)^ we have |||</>') —II = 0(5). By 
Fact 5, measuring |(/)') in the computational basis is indistinguishable from measuring |(/)'), except 
with probability 0(5). If we take 5 = 0(1/log n), the algorithm does not notice the difference on 
any of the O(logn) states used, with probability n(l). The overall complexity of the algorithm 
is therefore 0{VTn log^ n)^. As before, the failure probability can be made arbitrarily small via 
repetition. 

In Section 5 we discuss some barriers to improving the complexity and applicability of these 
algorithms. 


3 From quantum walks on trees to accelerating backtracking 

To complete the proofs of Theorems 1 and 2, we now verify that Algorithm 2 can be applied to 
search in the tree defined by a backtracking algorithm. In order to do this, it is sufficient to define 
a suitable efficient mapping between partial assignments and vertices in a tree, and to implement 
the operators Ra and Rb appropriately and efficiently. As the quantum walk subroutines assume 
that the root of the tree is not marked, the first step of the algorithm is to check whether P{*^) 
is true. If so, the algorithm immediately returns “true”; if not, it runs Algorithm 2 on a graph 
defined as follows. 

The current state of the backtracking algorithm is represented by a vertex in a rooted tree 
labelled with a sequence of the form (ii, ui),..., (i^, u^), for 1 < i < n. The sequence corresponds to 
a partial assignment x G H where we assign Xi^. = Vk for k = and Xj = * for all other indices 

j. The tree only contains vertices corresponding to valid partial assignments. Each vertex except for 
the root (which is labelled with the empty sequence) is connected to its parent, the vertex labelled 
with (ii, ui),..., (if_i, U£_i). It is also connected to all vertices of the form (ii, ui),..., (i^, u^), (j, w), 
where j = /i((ii, ui),..., (z£, ur)), w G [d], and P((zi, ui),..., (i^, ur), (j, re)) is not false. That is, 
all vertices corresponding to valid partial assignments which extend the current partial assigment 
by assigning a value to the variable whose index is given by /i. It is convenient to assume that 
the predicate P and the heuristic h take as input a string of (index, value) pairs which describe 
value assignments to variables, rather than an element of P; if not, converting between these 
representations can be done in time 0{n). We will also assume that, for all complete assignments, 
the predicate returns either true or false (as it should do). 

The algorithm takes place within the Hilbert space ® (C"'+^ ® (J 2 ;d+i^®n together 

with an ancilla space. Each basis vector within 77^”'^ represents a partial assignment described by 
a sequence as above. The first register stores a level i between 0 and n, representing the length 
of the sequence (the number of non-=t=’s in the assignment). Each of the next i registers stores a 
pair {ik,Vk) giving the index of a variable (an integer between 1 and n) and the assignment to that 
variable (an integer between 0 and d — 1). Except during updates to the state, the remaining n — i 
registers all contain the pair (0, *). The algorithm can easily be modified to use qubits if desired, 

^One way to improve the polylogarithmic factors in this complexity could be to reweight the tree such that the 
eigenvector of RbRa with eigenvalue 1 has more weight on xo (Alexander Belov, personal communication). 


11 



Input: A basis state |£)|(ii, vi))... |(in,G corresponding to a partial assignment 
Xjj = t>i,..., = V(,. Ancilla registers Uanc, ^next, ^children, storing a tuple (a, j, 5), where 

a G {*} U [d], j G {0,...,n}, 5 C [d], initialised to a = *, j = 0, 5 = 0. 

1. If P{x) is true, return. 

2. If£ is odd, subtract h{{ii,vi ),..., V£-i)) from ig and swap a with vg. 

3. If a / *, subtract 1 from £. (Now i is even and {ig+i,vg-\.i) = (0, *).) 

4. Add /i((ii,ui),..., {i£,vg)) to j. 

5. For each w G [d]: 

(a) If vi ),..., {ig, Vg), {j, w)) is not false, set 5 = S' U {tc}. 

6. If = 0, perform the operation I — 2\(f>n,s){4’n,s\ on P^nc- Otherwise, perform the 
operation / - 2\(l)i^s){4‘i,s\ on Pane- 

7. Uncompute S and j by reversing steps 5 and 4. 

8. If a 7^ *, add 1 to i. If i is now odd, add h{{ii,vi ),..., {ig-i,vg-i)) to ig and swap vg 
with a. (Now a = * again.) 

Algorithm 3: Implementation of the operator 


rather than systems with dimension n +1 and d+1, by encoding each subsystem in 0(logn + logd) 
qubits. 


Let Ua,s, for S C [d] and a G M, act on with basis {|*), |0),..., |d — 1)} by mapping 

I*) where 

1 


10 . 


‘a,S) ■ = 


-^a|S| + 


= I*)+I* 

1V h 


We assume that, for any subset S C [d] and any fixed a G M, we can perform Ua,s and its inverse 
in time 0(1) each. (Dependent on the gate set being used, we may not be able to implement Ua,s 
exactly. However, for any universal gate set we can implement it up to accuracy 1 — e in time 
poly log(l/e); this will multiply the runtime of the overall algorithm by at most a polylogarithmic 
factor.) By applying Ua,s and its inverse we can perform the operation I — 2\4>a^s){4‘a,s\- 

In order to use Algorithm 2, we need to implement the operators Ra and Rb- The implemen¬ 
tation of Ra using I — 2\(f>a,s){4'a,s\j P and h is described in Algorithm 3 above. Rb is similar, 
except that: step 1 is replaced with the check “If P{x) is true or ^ = 0, return”; “odd” is replaced 
with “even” in steps 2 and 8; and the check “If i = 0” is removed from step 6. The first of these 
changes is because Rb should leave the root of the tree invariant; and the last is because i is always 
odd at that point in the modified algorithm, so the check is unnecessary. 

We now argue that Algorithm 3 correctly implements Ra- Write x = {ii,vi ),..., {ig, vg) for the 
partial assignment passed to the algorithm, and write x' = (fi, ui),..., (i^-i, vg^i) for the parent 
partial assignment in the tree. The goal of the algorithm is to implement the operator Dx 

defined in Section 2. For each x G A, Dx only acts on the subspace corresponding to x and 
its children. To implement Dx, it is therefore sufficient to map the basis state corresponding to 
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(ii, ui),..., {ii, v^), and all the basis states corresponding to (zi, fi),..., (z^, Vi), {j, w) for w £ [d], 
where j = h{{ii,vi ),..., {ii,Vi)) and £ is even, to a (d + l)-dimensional subspace on which the 
children of x can be mixed over using Ua^Sj then returning to the original subspace. This is 
precisely what Algorithm 3 does. 

In more detail, the algorithm performs the following steps. First, it does nothing when x is 
marked, corresponding to the definition of Dx- If x is not marked, the behaviour depends on 
whether i is even (corresponding to x £ A) or i is odd (corresponding to x £ B). Define y by 
setting y = X X £ A, and y = x' x £ B. Then the algorithm implements an inversion about 
\'4^y)., which is split into three subparts; 

• Steps 2-3: Perform a map of the form |x) i—)■ |z/)|*) for x £ A, and |a:) i—)• \y)\w) for x £ B, 
where w is the value of x at the h(x')’th position, i.e. the most recent variable assignment 
that was made by the backtracking algorithm. 

• Steps 4-5; Determine the children of y. 

• Step 6: Perform the operation I — 2\^jJy){^py\ using the knowledge of the children of y. 

• Steps 7-8; Uncompute junk and reverse the first map. 

It can be verified that the algorithm implements the desired behaviour for all basis state inputs 
\£)\{ii-,vi)) .. .\{in,Vn)) such that (zi, ui),..., (z^, is a valid path in the backtracking tree; we 
omit the routine details. As the algorithm implements the operation Dx unitarily for 

all basis states |x), it also implements Ra correctly for all superpositions of basis states. Together 
with the similar implementation of this is enough to implement Algorithm 2. For each use of 
Ra and Rb the algorithm uses 0(1) auxiliary operations as claimed. 


4 Prom quadratic speedups to exponential speedups 

In this section we show that it is possible to leverage the speedup achieved by the quantum back¬ 
tracking algorithm to obtain much more significant speedups over classical algorithms - but in a 
non-standard, average-case setting. 

For any (classical or quantum) algorithm A, let Ta{X) denote the expected runtime of A on 
input X. Let V he a distribution on inputs X. Imagine we have a quantum algorithm Q and 
a classical algorithm C such that Tq{X) ~ \JTc{X) for all X. This is the case for the quantum 
algorithms presented here, where for CSPs on n variables we have Tq{X) < ■\jTc{X) poly(n). 
Then, by Jensen’s inequality, we have 

¥.xMTq{X)] < ^^xMTQiXf] < ^^xMTc{X)]. 

However, dependent on the distribution "P, taking the average in this way can sometimes amplify 
the separation to become much greater than quadratic, and even sometimes exponential or super¬ 
exponential. This point was noted in the context of quantum query complexity by Ambainis and 
de Wolf [4] , who gave several examples of super-polynomial average-case quantum speedups for the 
computation of total functions, and later by Montanaro [44], who showed that even the unstructured 
search problem with a unique marked element, with power-law distributions on the position of this 
marked element, can display this behaviour. 

One very simple example of this phenomenon is the following separation. Let C be a classical 
algorithm for Circuit SAT. Assume that, for each integer n, there exists an instance of Circuit SAT 
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on n variables such that C has runtime n(2"') (this is the case for the best classical algorithms at 
present [54]). Also let Q be a quantum algorithm which solves Circuit SAT using Grover’s algorithm, 
using time 0(2”/^ poly(n)) on an input of size n. Finally, let Vn be the following distribution on 
instances with n variables: with probability p, return a hard instance of size n; with probability 
1 — p, return a trivial instance. Then 

Ex^pjTciX)] = n{p2^), Ex^vJTq{X)] = 0(p2-/2 poly(n)). 

If we take p = the separation between these two quantities is exponential. 

However, this is clearly a rather contrived distribution on the inputs. One might hope to find 
some problem, together with a more natural distribution on the inputs, which allows a similar 
exponential separation to be proven. The quantum backtracking algorithm allows one to find 
separations of this form, given a backtracking algorithm with a suitable distribution of runtimes. 
Indeed, imagine we have a family of CSPs and a distribution Vn on problems of size n such that 
with high probability the problem has 0(1) solutions. Further imagine that we have a deterministic 
classical backtracking algorithm whose backtracking tree contains T{X) vertices on input X, such 
that Ptp^[T(X) = t] < Ctl^ for all t and some constants C and /3. In addition, assume that 
Prp^[T(A) = t] > Dt^, for some constant D, for M different values t. Here M is some large integer 
which we think of as being exponentially large in n. Then 

M 

Ex^vAT{X)] = f^(Ml^+2). 

i=l 

For /3 > —2, this quantity is exponentially large. However, if —2 < (3 < —3/2, the quantum 
backtracking algorithm described above uses an average of 

Ex~pJO(V7TOpoly(n))] < E 0{Vt ■ poly(n)) = poly(n) 

t>i 

quantum walk steps. If each step requires time poly(n) (to evaluate the predicate P and the 
heuristic h) we have obtained an exponential reduction in expected runtime. 

We therefore see that a “power law” tail of the distribution Vn of the form pt = Ptp^\T{X) = 
t] ~ for a suitable value of jS, gives us an exponential separation. There is substantial empirical 
evidence, and some analytical evidence, that such power law, or “heavy”, tails can occur in both 
random and real-world instances of CSPs; for a survey, see [33]. For example, consider the case of 
graph /c-colouring on random graphs with n vertices, where each edge is present with probability 
©(1/n). Hogg and Williams [35] observed that a natural backtracking algorithm seemed to have a 
power-law distribution of its runtimes. Later work by Jia and Moore [36] provided some analytical 
justification for this, and additional experiments, which together suggest that for 3-colouring the 
distribution is of the form pt ~ t~^. 

However, there are several reasons why it is unclear that this phenomenon could lead to expo¬ 
nential separations between quantum and classical expected runtimes. First, there is some evidence 
that some apparently heavy-tailed behaviour may in fact be due to finite-size effects [20, 21]. Sec¬ 
ond, one reason for a skewed runtime distribution could be that, on satisfiable instances, the 
backtracking algorithm sometimes gets lucky and happens to find a satisfying assignment early on, 
after which it terminates. The runtime of the quantum algorithm described here depends on the 
size of the whole tree and hence will not correspond to the square root of the classical runtime in 
this case. Indeed, runtime distributions on unsatisfiable instances do not seem to display the same 
heavy-tailed behaviour [46, 31]. 
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Third, in many cases power-law behaviour is observed when a randomised backtracking algo¬ 
rithm is run on a single instance. That is, when the choices of branching variables made by the 
algorithm are random and we consider the distribution of the runtimes T{r) over the choice of 
random seed r. Algorithmic randomness of this form (as opposed to picking the input instance at 
random) is not suitable for obtaining an exponential quantum-classical separation using the quan¬ 
tum backtracking algorithm. This is because, if the quantum backtracking algorithm’s expected 
runtime over r is at most R, for some R, we have R = n(Er[y^r(r)]). So, by Markov’s inequality, 
T(r) = 0{R^) with, say, 99% probability. Therefore, if we stop the classical algorithm after time 
0{R^), it will succeed with probability n(l). 

For these reasons, we consider random instances of CSPs produced not just by using a fixed 
density of constraints, but by taking a distribution over different constraint densities. This enables 
us to find relatively natural input distributions under which the expected runtime of the quantum 
backtracking algorithm is exponentially faster. 


4.1 Expected runtime bounds 

There is now a substantial body of work proving bounds on the expected runtime of DPLL-type 
algorithms for random /c-SAT. For example, consider k = 3 and instances consisting of m = an 
uniformly random clauses. For a > 4.3, there is strong evidence that such instances very rarely 
have a solution [48], so the task of the algorithm is usually to prove unsatisfiability. Beame et al. [6] 
have shown that, for a simple DPLL variant (known as ordered DLL) the runtime is with 

probability 1 — o(l). Cocco and Monasson [18, 19] have used statistical physics techniques to even 
determine (non-rigorously) the constant in the exponent. In particular, they argue that, for large 
a, the runtime is approximately 

Sometimes one can prove such tight bounds rigorously. For example, consider the following very 
simple backtracking algorithm, which fits within the framework of Algorithm 1. Fix an ordering 
of the variables from 1 to n. Then the heuristic h returns the lowest index of a variable which has 
not yet been assigned a value. Call this algorithm NaiveBt. Then the following result holds: 

Proposition 8. The expected number of vertices E in the backtracking tree of the NaiveBt algorithm 
when applied to a random k-SAT instance on n variables, with m = an uniformly random clauses, 
for 1 < a < n^~^, satisfies 

2^'^ <E< 0{n2^^), 

where C and C depend only on a and k. For k = 3, C < 0.907/v^, C > 0.906/-y/a — 0.142/a^. 
Proof. See Appendix B. □ 


Similar analyses to Proposition 8 have been carried out many times in the literature, albeit 
often for slightly different models (e.g. [13, 45]). 

Let Cn denote the expected runtime of the NaiveBt algorithm applied to 3-SAT instances on n 
variables, where the expectation is taken with respect to a distribution over numbers of constraints 
m. If the probability that we have m constraints is pm, then by Proposition 8 


Cn> E p 2’^(0-906-^n/m—0.142 (n/m)^) 


On the other hand, up to a poly(n) factor, on any instance the runtime of the quantum backtracking 
algorithm is at most the square root of the classical one, multiplied by the number of solutions (if 
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there are any). We consider a distribution pm which is only supported on values of m such that 
the probability that a random CSP with m constraints has a solution is 0(2“”'). For such values 
of m, we can ignore the additional cost for finding all the solutions. For 3-SAT, this is true for 
m > 16n/(ln2), for example (see Appendix B). 

Letting Qn denote the expected runtime of the quantum algorithm applied to NaiVeBt, we 
therefore have 

Qn < 

m 

Consider the distribution pm oc for 16n/(ln2) < m < n^. Then 


Qn = poly(n), Cn = ^{ 


2(0-906-0.454)n3/2/^-0.142(n/m)2^ _ j2('20.094n^ 

m> 16 n/(ln 2 ) 


where the asymptotic bound follows from inserting m = { 16n/(ln 2)]. We therefore have an average- 
case exponential separation between the quantum and classical complexities of 3-SAT under this 
distribution; and, indeed, for various other distributions of the form pm oc While 

this family of distributions is arguably less contrived than the Circuit SAT example given above 
(for example, the number of variables is fixed; only the number of clauses varies), it still appears 
somewhat unnatural. It seems to be an interesting question to determine more natural input 
distributions which also lead to exponential quantum speedups. 


5 Improving the quantum walk algorithm? 

We finish by addressing the question of how tight the bounds are which we have obtained on 
quantum search in trees. It is clear that, given a tree with T vertices, we must have a lower bound 
of the form Q.{-\/T) for finding a marked vertex (otherwise, we could use the algorithm to solve the 
unstructured search problem on T elements using o{VT) quantum queries, which is impossible [10]). 
There are several plausible ways in which the complexity of the algorithm presented here could be 
improved to get closer to this bound. However, there appear to be some challenges to doing so in 
each of these cases. 

1. Reduction of the dependence on the depth n. It is easy to see that, if we would like to apply 
the quantum backtracking algorithm to general trees, there must be some dependence on 
the depth in the runtime. Indeed, consider a path on T vertices, which has depth T — 1. 
Then, if the marked vertex is the last one in the path, we require n(r) steps to find it. More 
generally, it was shown by Aaronson and Ambainis [1] that for each pair T and n, there is 
a tree containing T vertices and with depth 0{n) such that determining the existence of a 
marked vertex requires Q{VTn) queries. This holds even if we know the tree in advance and 
are allowed to perform arbitrary “local” operations to search within it. 

2. Reduction of the overhead for searching with multiple marked vertices. It would be interesting 
to determine whether the search algorithm in Section 2.2 could be generalised to work with a 
similar efficiency for an arbitrary number of marked vertices. The question of when one can 
convert a quantum walk speedup for detecting a marked element to a speedup for finding a 
marked element has been studied previously. But while it was shown by Szegedy [51] that 
the time to detect a marked element using a quantum walk is at most the square root of the 
classical hitting time, it is not known whether the time to hnd a marked element has the 
same scaling in general. 
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Indeed, Krovi et al. [39] have described a way (generalising previous results of [52, 42]) to 
modify the original quantum walk approach of Szegedy to obtain a quadratic speedup for the 
search problem in the case where there is a unique marked element. However, if there is more 
than one marked element, the runtime of their algorithm scales with a quantity they call 
the extended hitting time, which may be larger than the hitting time. In any case, all these 
algorithms assume that the graph is known in advance and the initial state of the quantum 
walk algorithm corresponds to the stationary distribution of the random walk. Neither of 
these assumptions applies here. 

3. Reduction of the dependence on k to find one, or all, of k marked vertices. For the unstruc¬ 
tured search problem with k marked elements out of T, Grover’s algorithm can find a marked 
element using 0{\/T/k) queries, which implies an algorithm which finds all marked elements 
in 0{VTk) queries. It would be natural to hope for a bound of a similar form for quantum 
search on trees, e.g. O^sjTn/k') to find a marked vertex and 0{VTnk) to find all k of them. 
Unfortunately, it is far from clear that this can be achieved. 

Indeed, consider the following argument due to Alexander Belov. Imagine we have access to 
an algorithm A which finds one of A: > 1 marked vertices using o{^/Tn) queries, and consider 
an arbitrary tree containing one marked leaf (.q. Modify the tree by attaching a subtree of 
depth 0{\ogk) below that leaf containing k vertices, all of which are marked and are labelled 
such that io can be determined from their labels. Then, using A, we can find one of these 
vertices using o{VTn) queries. Finding such a vertex enables us to find io with no additional 
queries, contradicting the aforementioned ^{y/Tn) lower bound [1]. However, this argument 
does not rule out the possibility that some other approach could find all k marked vertices 
in, for example, 0{VTnk) time. 

One other way in which it might be possible to improve the quantum backtracking algorithm is in 
situations where the classical backtracking algorithm is lucky and finds a solution without exploring 
the whole tree. For such instances the quantum algorithm, which is forced to explore the whole 
tree, may not outperform the classical algorithm. It might be possible to improve the performance 
of the quantum algorithm in this situation by biasing it to prefer to explore the parts of the tree 
visited by the classical algorithm earlier on. 
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A Proof of technical claim for search with one marked element 

In this appendix, we prove the following claim from Section 2.2: 

Lemma 7 (restated). II = 0{x's/Tn). 

Let xq be the unique marked vertex, assuming for simplicity in the proof (as justified in Section 
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2.2) that (.{ xq ) = n, and hence that xq is a leaf in the tree. We can write 




V2\r) - 10') = V2\r) - 




V^\r)+ (- 1 )'^"^! 

x^r,x-^xo 




x^r,X'^XQ 


Recall that Ilyi and IIb are projectors onto the invariant subspaces of Ra and Rb- The invariant 
subspace of Ra is spanned by vectors of the form |0^) for each vertex x ^ A, and if xq G A, in 
addition the vector The invariant subspace of is similar (replacing A with B) but also 

contains |r). Here ) = 0 and 10^) has support only on {|x)} U {|y) : x —)• y}. In order 

to apply the effective spectral gap lemma, we determine a vector |^) such that H^il^) = 0 and 

nslO = 

First assume xq G B. We will take |^) to be a linear combination of vectors |0a;) for x € A. 
Then the first of these two constraints is immediately satisfied. The second will be satished if, for 
a set of vectors |C) which span the invariant subspace of Rb, i-e. 

10 e {k),IV’xo)}U{|'0i-) : i'lp^l'ipx) =0 ,xGB}, 

we have (CIO = (CIC’"*')- To compute the required inner products, first observe that l^Px) 
support on x and its children, so for all x not on the path from r to xq, {'4>x\4’'^) — 0 - f^ie other 
hand, for each x £ B such that x ^ xq, dehne a basis for the space span{|0;;!-) : (^O'C'a;) = 0} by 
hxing the vectors 

= - 0 ) + (dx - l)|iVi(a:)) - 

where iVj(x) denotes the z’th child of x, recalling that dx denotes the degree of x. We have 




—dx/V^ if f = zq 
0 otherwise 


( 2 ) 


where zq denotes the unique child of x on the path to xq. 

We now find a vector |C) = Ylix^^\^) satisfying the above constraints. First, we require = 
(r|C) = (z’|0“'“) = 1/^/2 and Uxq = (xol^"*") = For each x, let x' denote the parent of x in 

the tree. For |C) to be a linear combination of vectors \'ipx), x G A, it is necessary and sufficient 
that Ux = Oix' for all x G R; except in the case i{x) = 1, where we require Ux = y/nur- We in 
addition need ax = ax' for all x 7 ^ r G A such that x' is not on the path to xq, in order that 
(0,^/1^) = {'4’x'\^'^) ~ b- For each child y of x G R, set = 7 if y 7 ^- xq, and = <5 if y xq. 
Then from (2) we have the final constraints that 


-ax + 27 - (i = 0 if y 7 A xo. 


-ax - {dx - 2)7 + {dx -1)6 = - 

These equations have unique solution j = ax — 1/6 
all coefficients ax- In particular, observe that 


dx -r 

-7= if y Xq. 

v 2 n 

= ax — 7 / 270 . This now uniquely dehnes 


axo — 



n — 1 


2 



1 
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as required. 

This constructs |^) in the case where xq € B. If instead xq G A, the procedure is similar. 
Now \'iPxq) is not in the invariant subspace of Rb (which only makes it easier to satisfy the inner 
product constraints), but also |.^) must be a linear combination of vectors |V’x) corresponding only 
to unmarked vertices x £ A. This new constraint implies that now = 0. But following the 
above procedure now gives 



as required. In either case, for all x, we have \ax\ < So |||^)|| = 0{\/Tn) and hence, by the 

effective spectral gap lemma (Lemma 3), ||T((.|(^-*-)|| = 0{xy/Tn). 


B The runtime of the NaiveBt backtracking algorithm for k-SAT 


Here we find simple, yet fairly precise, bounds on the expected number of vertices in the backtrack¬ 
ing tree for the NaiveBt algorithm when applied to random A:-SAT with k = 0(1). 

Assume we pick a random instance of fe-SAT by choosing m = an clauses, for some a such that 
1 < a < . Each clause contains k distinct variables, and each variable can be present negated 

or unnegated. Each clause is chosen uniformly at random, with replacement, from the set of all 
2^(^) allowed such clauses. Recall that the NaiveBt algorithm is a backtracking algorithm in the 
framework of Algorithm 1 where the heuristic h simply picks the lowest index of a variable which 
has not yet been assigned a value. 

The probability that a given assignment to variables xi,..., is consistent with all the clauses 
in such a random instance is 


(1 — Pr[clause only depends on first ^ variables] Pr[assignment fails to satisfy clause])™ 

i 2^^; ■ 

The expected number of solutions is 

In2)o)n 

SO for a ;:g> 2^/In 2, the probability that the instance is satisfiable is exponentially small in n. The 
expected number of vertices in the backtracking tree is 




1=0 


AL 


We have 


max 2 1 — 


TL 

2‘0 


< E < {n + 1) max 2^ 1 — 


AL 


Eor the upper bound, using 1 — x < e ^ we obtain 

\ m 


max 2 1 — 


Ah 

2‘0 


f _n-2~^yf fc+1) — u‘2 /p\ , , 

<max2^e D--("-fe+i) < max2^e""^ ^(i-^ A). (3 

i £ 
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By taking the derivative over ^ we get that the maximum is achieved for ^ such that 


A:(A: - ^ 

The left-hand side is of the form — 0(l/f)) = I — 0(1), so we can ignore the 0(1/1') correction 
term as it can only change the final bound by an 0(1) factor. Rounding I to the nearest integer 
similarly does not change the asymptotic complexity. So, inserting the right-hand side in (3) and 
simplifying, we obtain 


E = 0(„2C>‘), where C = ( —) (l - -) . 

On the other hand, inserting this value of i into the lower bound and using the inequality 1 — a: > 

_ X _ 

^ X X gjygg 

E = = 0 ( 2 ^'-), 

where 

, _ /2^1n2y''^^“^Y 1 ln2 /2^1n2y''^^“^^\ 

\ ak j Y A: \ ak j J 

For example, for k = 3, we have C < 0.907/vTi, C > O.OOG/yTt — 0.142/a2. 
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